REMARKS 

Claims 1-14 are all the claims pending in the application. 

I. Objection to the Drawings 

A. The Examiner has objected to the drawings for the reasons set forth in item 2 on page 
2 of the Office Action. In particular, the Examiner has indicated that Figs. 1 and 2 should be 
identified as -Prior Art--. By this amendment, Applicants are submitting herewith replacement 
sheets for Figs. 1 and 2 which include the -Prior Art- legend. Accordingly, Applicants kindly 
request that the above-noted objection be reconsidered and withdrawn. 

B. The Examiner has also objected to the drawings for the reasons set forth in item 3 on 
pages 2-3 of the Office Action. In particular, the Examiner has indicated that the specification 
does not refer to Figs. 3 A, 3B, 5A-5C, 6A, 6B, 7A, 7B, 9A, 9B, 10A, 10B, 1 1 A, 1 IB, 12A, 12B 
and 13A-13C. In order to overcome this objection, Applicants have amended the specification so 
as to make reference to each of the above-noted drawings. 

In this regard, due to the number of changes made to the specification, Applicants are 
submitting herewith a substitute specification and abstract. Applicants note that various editorial 
amendments that have also been made for grammatical and general readability purposes. No new 
matter has been added. Also enclosed is marked-up copy of the original specification and 
abstract showing the changes incorporated into the substitute specification and abstract. 

In view of the foregoing, Applicants kindly request that the Examiner reconsider and 
withdraw the above-noted objection. 
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II. Claim Rejections under 35 U.S.C. § 101 

A. Claims 1-8, 1 1 and 12 have been rejected under 35 U.S.C. § 101 as being directed to 
non- statutory subject matter. In particular, the Examiner has indicated that in order for claims 1- 
8,11 and 12 to be considered statutory, that the claims must be either (1) tied to a "particular 
apparatus" or (2) transform underlying subject matter to a different state or thing. 

By this amendment, Applicants note that claims 1,5, 11 and 12 have been amended such 
that each of these claims is now tied to a "particular apparatus", thereby addressing the 
Examiner's comments in the Office Action. 

In view of the foregoing, Applicants respectfully submit that claims 1,5, 11 and 12, as 
well as dependent claims 2-4 and 6-8, are directed to statutory subject matter. Accordingly, 
Applicants kindly request that the above-noted rejection be reconsidered and withdrawn. 

B. Claims 1 1 and 12 have been rejected under 35 U.S.C. § 101 as being directed to non- 
statutory subject matter. By this amendment, Applicants note that claims 1 1 and 12 have been 
amended as suggested by the Examiner so as to indicate that the program is embodied on a 
computer-readable medium, thereby addressing this rejection. 

In view of the foregoing, Applicants respectfully submit that claims 1 1 and 12 are 
directed to statutory subject matter and, therefore, kindly request that the above-noted rejection 
be reconsidered and withdrawn. 
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III. Allowable Subject Matter 

Applicants thank the Examiner for indicating that claims 9, 10, 13 and 14 are allowed. 
Regarding claims 1-8, 11 and 12, Applicants note that because all of the rejections of these 
claims have been addressed by the present amendment, it is respectfully submitted that all of the 
claims of the present application are now in condition for allowance, an indication of which is 
kindly requested. 

IV. Conclusion 

In view of the above, reconsideration and allowance of this application are now believed 
to be in order, and such actions arc hereby solicited. If any points remain in issue, the Examiner 
is kindly requested to contact the undersigned at the telephone number listed below. 

The Commissioner is authorized to charge any deficiency or to credit any overpayment 
associated with this communication to Deposit Account No. 23-0975. 

Respectfully submitted, 

Kiyofumi ABE et al. 

/Kenneth W. Fields/ 

Bv: 2009.07.24 1 2:35:27 -04'00' 

Kenneth W. Fields 
Registration No. 52,430 
Attorney for Applicants 



KWF/krg 

Washington, D.C. 20005-1503 
Telephone (202) 721-8200 
Facsimile (202) 721-8250 
July 24, 2009 
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Version with Markings to 
Show Changes Made 

DESCRIPTION 

METHOD FOR ENCODING MOVING IMAGE AND METHOD FOR 
DECODING MOVING IMAGE 

5 Technical Field 

This invention relates to a moving image encoding method for 
encoding a moving image signal after dividing the moving image 
signal composed of luminance components and chrominance 
components into blocks, and to a moving image decoding method for 
10 decoding the encoded data. 

Background Art 

In the age of multimedia which integrally handles audio, video 
and other pixel values, existing information media, specifically, 

15 newspaper, magazine, television, radio, telephone and the like 
through which information is conveyed to people, have recently 
come to be included in the scope of multimedia. Generally, 
multimedia refers to something that is represented by associating 
not only characters, but also graphics, sound, and especially images 

20 and the like, together, but in order to include the aforementioned 
existing information media in the scope of multimedia, it becomes a 
prerequisite to represent such information in a digital form. 

However, if the amount of information carried by each of the 
mentioned information media is estimated as the amount of digital 

25 information, while the amount of information for 1 character in the 
case of text is 1 to 2 bytes, the amount of information required for 
sound is 64 Kbits per second (telephone quality), and 100 Mbits or 
over per second becomes necessary for moving images (current 
television receiving quality), it is not realistic for the information 

30 media to handle such an enormous amount of information as it is in 
digital form. For example, although video phones are already in 
tbe-actual use via Integrated Services Digital Network (ISDN) which 



offers a transmission speed of 64 kbps to 1.5 Mbps, it is impossible 
to transmit images on televisions and images taken by cameras 
directly through ISDN. 

Accordingly, information compression techniques have 
5 become required, and for example, in the case of the video phone, 
the H.261 and H.263 standards for moving image compression 
technology, internationally standardized by the International 
Telecommunication Union - Telecommunication Standardization 
Sector (ITU-T), are being employed. Moreover, with MPEG-1 

10 standard information compression techniques, it has also become 
possible to store video information onto general music compact 
discs (CD) together with audio information. 

Here, a Moving Picture Experts Group (MPEG) is an 
international standard for a moving image signal digital 

15 compression. The MPEG-1 is a standard for compressing moving 
image signals up to 1.5Mbps, in other words, compressing television 
signals up to approximately a hundredth-ea ^ of the original size . 
Moreover, since target picture quality within the scope of the 
MPEG-1 standard is limited to a medium degree of quality which can 

20 be realized by a transmission speed of primarily about 1.5Mbps, the 
use of MPEG-2, which was standardized to satisfy demands for 
further improved picture quality, realizes television broadcasting 
quality with moving image signals compressed to 2 to 15Mbps. 

Furthermore, a MPEG-4 having a higher compression ratio has 

25 been standardized by a working group (ISO/IEC JTC1/SC29/WG11) 
which has pursued standardizations of MPEG-1 and MPEG-2. The 
MPEG-4 not only enables an efficient coding with a low bit rate, but 
also introduces a strong error tolerance technique which can reduce 
subjective picture quality degradation even when a transmission line 

30 error has ts-occurred. In addition, currently, as a next generation 
screen coding method, a standardization of H. 264 has been pursued 
by a cooperation of ISO/IEC with ITU-T. 



-2- 



In general, in encoding of a moving image, the amount of 
information is compressed by reducing redundancy in temporal and 
spatial directions. Here, in an inter prediction encoding for 
reducing temporal redundancy, a motion estimation and a 
5 generation of a prediction image are performed on a block-by-block 
basis by referring to preceding and following images, and encoding 
is performed on a difference between the obtained prediction image 
and a block to be encoded. Further, in an inter prediction encoding 
for reducing spatial redundancy, a prediction image is generated 

10 from pixel information of a_neighboring encoded block, and the 
encoding is performed on a difference between the obtained 
prediction image and the block to be encoded. 

Here, a picture is a language indicating one screen. It 
indicates one frame when coded as a frame structure. Also, it 

15 indicates one field when coded as a field structure. 

Each picture is divided into blocks called micro blocks, for 
example, each of which is horizontal 16 x vertical 16 pixels, and is 
processed on a block-by-block basis. The picture of the field 
structure is encoded by including all micro blocks as a field macro 

20 block. On the other hand, the picture of the frame structure can be 
encoded not only by using all micro blocks as a frame macro block, 
but also by switching to a frame or a field on a basis of vertically 
sequential two macro blocks as one unit (a macro block pair). 

FIG. 1 is a block diagram showing a structure of a moving 

25 image encoding apparatus which realizes a conventional moving 
image encoding method. The moving image encoding apparatus 
includes a picture memory 101, a prediction residual encoding unit 
102, a bit stream generation unit 103, a prediction residual decoding 
unit 104, a deblocking unit 105, a picture memory 106, an encoding 

30 mode controlling unit 107, an inter prediction image generation unit 
108 and an intra prediction image generation unit 109. 

In a displaying order of the bit stream to be encoded, the bit 
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stream is inputted to the picture memory 101 on a 
picture-by-picture basis, and the pictures are sorted in an encoding 
order. Further, each of the pictures is divided into macro blocks and 
the following processing is applied on each micro block. 
5 There are mainly two types of encoding method methods : an 

inter prediction encoding; and an intra prediction encoding. Here, 
it is explained first about the inter prediction encoding. 

An input image signal read out from the picture memory 101 
is inputted to a difference arithmetic unit 110, and a difference 

10 image signal obtained by calculating a difference with the prediction 
image signal that is an output of the inter prediction image 
generation unit 108 is outputted to the prediction residual encoding 
unit 102. The prediction residual encoding unit 102 performs 
image encoding processing such as frequency conversion and 

15 quantization so as to output a residual signal. The residual signal is 
inputted to the prediction residual decoding unit 104, and an a - 
image decoding processing such as inverse quantization and inverse 
frequency conversion is performed so as to output a residual 
decoded signal. A sum arithmetic unit 111 adds the residual 

20 decoded signal and the prediction image signal so as to generate a 
reconstructed image signal. The reconstructed image signal is 
processed for reducing distortion which occurs in a boundary 
between blocks that are divided when encoding is performed by the 
deblocking processing unit 105, before being stored as a reference 

25 picture into the picture memory 106. 

On the other hand, the input image signal on a macro 
block-by-block basis read out from the picture memory 101 is also 
inputted to the inter prediction image generation unit 108. Here, 
targeting one picture stored in the picture memory 106 or encoded 

30 pictures, an image area which is closest to the input image signal is 
detected and outputted as a prediction image. The prediction 
image is used for generating a difference image signal in the 
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difference arithmetic unit 110 and for generating a reconstructed 
image signal in the sum arithmetic unit 111. 

The bit stream generation unit 103 performs variable length 
encoding on various encoded information outputted by the series of 
5 above processing so as to obtain a bit stream (moving image 
encoded data) to be outputted by the encoding processing. 

While this flow of processing is an operation in the case where 
the inter prediction encoding is performed, it is switched to the intra 
prediction encoding by the switch 112. Hereafter, it is explained 

10 about the intra prediction encoding. 

The input image signal read out from the picture memory 101 
is inputted to the difference arithmetic unit 110, and the difference 
image signal obtained by calculating a difference with the prediction 
image signal that is an output of the intra prediction image 

15 generation unit 109 is outputted to the prediction residual encoding 
unit 102. The prediction residual encoding unit 102 performs 
image encoding processing such as frequency conversion and 
quantization so as to output a residual signal. The residual signal is 
inputted to the prediction residual decoding unit 104, and the image 

20 decoding processing such as inverse conversion and inverse 
frequency conversion is performed so as to output the residual 
decoded signal. The sum arithmetic unit 111 adds the residual 
decoded signal and the prediction image signal, and generates a 
reconstructed image signal. The reconstructed image signal is 

25 processed for reducing distortion which occurs in a boundary 
between blocks that are divided when the deblocking processing unit 
104 performs encoding. 

On the other hand, the input image signal read out from the 
picture memory 101 on a macro block-by-block basis is also inputted 

30 to the intra prediction image generation unit 109. Here, a 
prediction image is generated by referring to the reconstructed 
image signal of one or more neighboring blocks in the same picture 
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generated as an output of the sum arithmetic unit 111. The 
prediction image is used for generating a difference image signal in 
the difference arithmetic unit 110 and for generating a 
reconstructed image signal in the sum arithmetic unit 111. 
5 The bit stream generation unit 103 performs variable length 

encoding on various encoded information outputted by the series of 
processing so that a bit stream outputted by the encoding 
processing is obtained. 

Each encoding mode of the inter prediction encoding and the 

10 intra prediction encoding is controlled by the encoding mode control 
unit 107 and switched on a macro block-by-block basis. 

FIG. 2 is a block diagram showing a structure of a moving 
image decoding apparatus which realizes a conventional moving 
image decoding method. The moving image decoding apparatus 

15 includes a bit stream analyzing unit 201, a prediction r esidual 
decoding unit 202, a deblocking unit 203, a picture memory 204, a 
decoding mode controlling unit 205, an inter prediction image 
generation unit 206 and an intra prediction image generation unit 
207. 

20 First, the bit stream analyzing unit 201 extracts various 

information from the inputted bit stream (moving image encoded 
data), and the information relating to a decoding mode and the 
residual coded signal are respectively outputted to the decoding 
mode controlling unit 205 and the prediction residual decoding unit 

25 202. 

There are two types of decoding method methods : an inter 
prediction decoding; and an intra prediction decoding. Here, it is 
explained first about an inter prediction decoding. 

The prediction residual decoding unit 202 performs image 
30 decoding processing such as inverse quantization and inverse 
frequency conversion on the inputted residual encoded signal, and 
outputs the residual decoded signal. The sum arithmetic unit 208 
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adds the residual decoded signal and the prediction image signal 
outputted from the inter prediction image generation unit 206, and 
generates a decoded image signal. The deblocking unit 203 
performs processing for reducing distortion which occurs in a 
5 boundary between blocks on the decoded image signal before being 
stored into the picture memory 204 as a picture for reference or 
display. 

On the other hand, the inter prediction image generation unit 

206 takes out a specified image area from one or more decoded 
10 pictures stored in the picture memory 204, and generates a 

prediction image. The prediction image is used for generating a 
decoded image signal by the sum arithmetic unit 208. 

The decoded image generated by the series of processing is 
outputted by-as an image signal for display from the picture memory 

15 204 according to the timing to be displayed. 

While the flow of processing is an operation in the case where 
the inter prediction decoding is performed, it is switched to the intra 
prediction decoding by the switch 209. Hereafter, it is explained 
about the intra prediction encoding. 

20 The prediction residual decoding unit 202 performs image 

decoding processing such as inverse quantization and inverse 
frequency conversion on the inputted residual encoded signal, and 
outputs a residual decoded signal. The sum arithmetic unit 208 
adds the residual decoded signal and the prediction image signal 

25 outputted from the intra prediction image generation unit 207, and 
generates a decoded image signal. The deblocking processing unit 
203 performs processing for reducing distortion which occurs in a 
boundary between blocks on the decoded image signal before being 
stored in the picture memory 204 as a picture for display. 

30 On the other hand, the intra prediction image generation unit 

207 generates a prediction image by referring to a decoded image 
signal of one or more neighboring blocks in the same picture 
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generated as an output of the sum arithmetic unit 208. The 
prediction image is used for generating the decoded image signal in 
the sum arithmetic unit 208. 

The decoded image generated by the series of processing is 
5 outputted as an image signal for display from the picture memory 
204 according to the timing to be displayed. 

Note that, each of the decoding modes of the inter prediction 
decoding and the intra prediction decoding is controlled by the 
decoding mode controlling unit 205 and switched on a macro 
10 block-by-block basis. 

Next, it is explained in detail about processing in the 
deblocking units 105 and 203. Here, the processing details in the 
encoding processing and the processing details in the decoding 
processing are precisely identical to each other. Therefore, they 
15 are explained together as the same processing. 

FIGS. 3A and 3B 3 -are drawings for explaining a method of 
determining types of filters used for deblocking. Here, as an 
example, assuming that there are five types of filters, the filters are 
used by being switched according to a characteristic of a block 
20 boundary. It is structured to apply a stronger filter (here indicates 
Filter 4) to a portion where has a higher possibility in which block 
distortion eminently occurs, and a weaker filter (here indicates Filter 
0) to a portion where has a lower possibility in which block distortion 
eminently occurs. 

25 FIG. 3A is a drawing showing a boundary between blocks to 

which filters are applied. In the drawing, the center line indicates a 
boundary between blocks; a pixel on the right side shown as Q 
indicates a pixel adjacent to the boundary in the target block; and 
the pixel on the left side shown as P indicates a pixel adjacent to the 

30 boundary in the adjacent block. FIG. 3B is a table showing which 
filters are selected with which conditions the pixel P and the pixel Q 
shown in FIG. 3B have. For example, the Filter 4 is selected in the 
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case where the boundary is at a vertical edge and one of the pixels 
P and Q belongs to a block which is intra prediction encoded. 
Similarly, the Filter 3 is selected in the case where the boundary is 
at a horizontal edge and one of the pixels P and Q belongs to a block 
5 which is intra prediction encoded. Also, the Filter 2 is selected in 
the case where one of the pixels P and Q belongs to a block which 
has a coefficient other than 0 of a spatial frequency component 
converted by frequency conversion. Further, the Filter 1 is selected 
in the case where the pixels P and Q belong to a block which is inter 

10 prediction encoded and refer to respectively different pictures or 
different motion vectors. Furthermore, the Filter 0 is selected in 
the case where it is not applied to any of the above conditions. 

Here, the table of FIG. 3B shows an example of a method of 
selecting filters. The number of filters and the selection conditions 

15 are not limited to the example. Therefore, the other cases can be 
similarly treated. 

Next, a flow of deblocking processing is explained with 
reference to a flowchart shown in FIG. 4. The target data is 
managed in separated forms of data for luminance and data of 

20 chrominance. Therefore, deblocking is separately applied to each 
component. 

First, in order to perform deblocking on luminance 
components, a loop processing is repeated as many as the number 
of pixels of the luminance component adjacent to a target block 

25 boundary (Fl and F4); a type of filter explained with reference to 
FIG. 3 is selected in each loop (F2); and the filter is applied (F3). 
The information of the type of the selected filter herein is stored in 
the memory region in which the information can be seen in later 
processing, while being used for applying filtering on the target pixel 

30 of the luminance component (F5). In order to target, for each pixel, 
a boundary which is at a vertical edge on the left side and a 
boundary which is a horizontal edge on the upper side, the above 
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mentioned processing is applied eight times in the case of a block 
which is, for example, made up of horizontal 4 x vertical 4 pixels. 

Next, in order to perform deblocking on chrominance 
components, a loop processing is turned as many as the number of 
5 pixels of the chrominance components adjacent to the target block 
boundary (F6 and F10); a type of filter is selected in each loop (F8); 
and the selected filter is applied (F9). Herein, a filter to be applied 
is determined according to the type of the filter used for the 
luminance component. Specifically, the type of the filter applied at 

10 a position of a pixel of the corresponding luminance component is 
referred to and used from the memory region in which the 
information about the type of filter determined in the processing for 
the luminance component is stored. Here, the following equations 
are used for converting the position of a target pixel of the 

15 chrominance component to a position of a corresponding pixel of the 
luminance component (F7). Note that, XL indicates a horizontal 
coordinate value of the luminance, XC indicates a horizontal 
coordinate value of the chrominance, YL indicates a vertical 
coordinate value of the luminance, and YC indicates a vertical 

20 coordinate value of the chrominance. 
XL=2xXC (equation 1(a)) 
YL=2xYC (equation 1(b)) 

Thus, deblocking is performed on the chrominance 
component by applying a filter determined by the above mentioned 

25 processing. 

Next, it is explained about a relationship between the 
luminance components and the chrominance components. FIGS. 
5A-5C 5 — are drawings for explaining positional relationships 
between the luminance components and the chrominance 

30 components. In the drawings, x mark indicates a sample position 
of the luminance component and O mark indicates a sample position 
of the chrominance component. 
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In general, eyes of humans are insensitive to changes of the 
chrominance components. Therefore, it is more likely that the 
color components are decimated for use. While there are various 
decimation method methods , FIG. 5A indicates a positional 
5 relationship in the case where the chrominance components are 
decimated to a half amount in both vertical and horizontal directions. 
FIG. 5B indicates a positional relationship in the case where 
chrominance components are decimated to a half amount only in a 
horizontal direction. FIG. 5C indicates a positional relationship in 

10 the case where decimation is not performed. In the case of the 
positional relationship as shown in FIG. 5A, the equation 1(a) and 
the equation 1(b) are used for calculating a pixel position of the 
corresponding luminance component for deblocking the 
chrominance component. 

15 Further, FIGS. 6A-6C show fiG-. — 6 shows a positional 

relationship in a frame structure and in a field structure in the case 
where the chrominance components are decimated to a half amount 
in both vertical and horizontal directions. FIG. 6A shows a frame 
structure when processing is performed after decimating the 

20 chrominance components. FIG. 6B shows a field structure which is 
replaced to the frame structure. Specifically, zeroth, second and 
fourth lines of the luminance components are assigned to a top field 
and first, third, and fifth lines are assigned to a bottom field. (Refer 
to: ITU-T Rec. H. 264 | ISO/IEC 14496-10 AVC Draft Text of Final 

25 Draft International Standard (FDIS) of Joint Video Specification 
(2003-3-31)). 

However, in the conventional structure, the type of a filter 
used for the luminance component at a pixel position converted 
using the equation 1(a) and the equation 1(b) is applied to a pixel of 
30 the chrominance component. Therefore, there was a problem of 
causing inconsistency that a filter to be applied to the chrominance 
component is determined by referring to the luminance component 
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in the top field, in the case where an image to be displayed in an 
interlaced-scan form is encoded and decoded in the frame structure. 
FIGS. 7A and 7B 7 — are drawings for explaining the reference 
relationship therein. FIG. 7A shows a positional relationship 
5 between the luminance components and the chrominance 
components when the picture is encoded and decoded in the frame 
structure. FIG. 7B shows a positional relationship between the 
luminance components and the chrominance components when the 
image is replaced to the field structure. Here, L_0 indicates a 

10 position of the luminance components at zeroth line, and C_0 
indicates a position of the chrominance components at zeroth line. 
It is indicated that the luminance component at L_2 is referred by 
the equation 1(b) when a deblocking filter is applied to the color 
component of C_l. However, when the pixel is replaced to the field 

15 structure, it is found that a type of filter is determined by which the 
chrominance at C_l in the bottom field refers to the luminance 
component at L_2 in the top field. 

As described above, in the picture having the frame structure, 
all macro blocks can be encoded not only as frame macro blocks but 

20 also by switching to the frame structure or the field structure on a 
macro block pair-by-pair basis. Also, in the case where the picture 
is encoded in the field structure, it is possible to use respective 
encoding modes for the top field and the bottom field. 

Accordingly, for example, in the case where an intra 

25 prediction encoding mode is used in the top field and an inter 
prediction encoding mode is used in the bottom field, picture quality 
is degraded in the chrominance components in the bottom field. In 
other words, basically, a strong filter is adapted for the intra 
prediction encoding mode and a weak filter is adapted for the inter 

30 prediction encoding mode. Therefore, an originally weak filter 
should be applied to the chrominance components in the bottom 
field. However, as described in the above, a type of filter for the 



-12- 



chrominance components in the bottom field is determined by 
referring to the luminance components in the top field. Therefore, 
a strong filter is adapted. Consequently, the picture quality is 
degraded in the chrominance components in the bottom field so that 
5 the image is not consistent when it is displayed in the 
interlaced-scan form. 

Further, the same thing is also applied to the case where, 
even if a same encoding mode is used for the top field and the 
bottom field of the target macro block, an adjacent macro block is 

10 encoded in the field structure and respective encoding modes are 
used for the top field and the bottom field. 

As described in the above, in the case where the image to be 
displayed in the interlaced-scan form is encoded and decoded in the 
frame structure, there is a problem that an inappropriate type of 

15 filter is applied because there is a case where the type of filter 
applied to the chrominance components is determined by referring 
to the luminance components in a different field. 

Disclosure of Invention 

20 In order to solve the conventional problems, it is an object of 

the present invention to provide a moving image encoding method 
and a moving image decoding method for enabling to apply a most 
appropriate type of filter for deblocking even in the case where a 
image to be displayed in an interlaced-scan form is encoded and 

25 decoded. 

In order to achieve the aforementioned object, a moving 
image encoding method according to the present invention is a 
moving image encoding method for encoding an inputted picture on 
a block-by-block basis after dividing the picture into blocks, the 
30 picture including luminance components and chrominance 
components, the method including: a prediction encoding step of 
encoding the picture by referring to an encoded area in the picture 
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or a different encoded picture; a reconstructed picture generation 
step of reconstructing the picture using encoded data generated in 
the prediction encoding step; and a deblocking step of reducing 
distortion by filtering a boundary between blocks of the 
5 reconstructed picture generated in the reconstructed picture 
generation step, wherein in the deblocking step, filters are 
respectively applied to the luminance components and the 
chrominance components: a type of the filters is selected based on 
encoding information of each luminance component and the 

10 selected filter is applied to the luminance component; and a filter 
selected for a luminance component, which belongs to a same field 
as a chrominance component and corresponds to the chrominance 
component, is applied to the chrominance component. 

Accordingly, the same type of deblocking filter is applied to 

15 the luminance components and chrominance components in the 
same field. Therefore, an encoding so as to generate a consistent 
image when the image is displayed in the interlaced-scan form after 
the image is decoded. Further, it is possible to combine a structure 
of the present invention easily into the conventional structure 

20 because there are very few changes from the conventional 
structure. 

Also, a moving image decoding method according to the 
present invention is a moving image decoding method for decoding 
encoded moving image data obtained by encoding, on a 

25 block-by-block basis, a picture which is divided into blocks, the 
picture including luminance components and chrominance 
components, the method including: a prediction decoding step of 
decoding the picture by referring to a decoded area in the picture or 
a different decoded picture; and a deblocking step of reducing 

30 distortion by filtering a boundary between blocks of the decoded 
picture generated in said prediction decoding step, wherein in the 
deblocking step, filters are respectively applied to the luminance 
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components and the chrominance components: a type of the filters 
is selected based on encoding information of the luminance 
component and the selected filter is applied to each luminance 
component; and a filter selected for a luminance component, which 
5 belongs to a same field as a chrominance component and 
corresponds to the chrominance component, is applied to the 
chrominance component. 

Accordingly, a same type of deblocking filter is applied to the 
luminance components and chrominance components in the same 

10 field. Further, it is possible to combine a structure of the present 
invention easily into the conventional structure because there are 
very few changes from the conventional structure. 

Note that, the present invention is not only realized as such 
moving image encoding method and the moving image decoding 

15 method, but also as a moving image encoding apparatus and a 
moving image decoding apparatus having characteristic steps 
included in the moving image encoding method and the moving 
image decoding method as units. Further, it can be realized as a 
program for causing a computer to execute those steps and as 

20 moving image data encoded by the moving image encoding method. 
Furthermore, such program and the encoded moving image data can 
be distributed via a recording medium such as a CD-ROM and a 
transmission medium such as the Internet. 

According to the moving image encoding method by the 

25 present invention, same type of deblocking filter is applied to the 
luminance components and the chrominance components in the 
same field. Therefore, it is possible to perform encoding so as to 
generate a consistent image when the image is displayed in the 
interlaced-scan form after the image is decoded. Further, it is 

30 possible to combine a structure of the present invention easily into 
the conventional structure because there are few changes from the 
conventional structure. 
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In addition, according to the moving image decoding method 
by the present invention, a same type of deblocking filter is applied 
to the luminance components and chrominance components in the 
same field. Therefore, it is possible to generate a consistent image 
5 in the case of displaying the image in the interlaced-scan form. 
Further, it is possible to combine a structure of the present invention 
easily into the conventional structure because there are very few 
changes from the conventional structure. 

10 Brief Description of Drawings 

FIG. 1 is a block diagram showing a structure of a 
conventional moving image encoding apparatus. 

FIG. 2 is a block diagram showing a structure of a 
conventional moving image decoding apparatus. 
15 FIGS. 3A and 3B 3 -are schematic drawings showing a method 

of determining types of deblocking filters. 

FIG. 4 is a flowchart for explaining a flow of deblocking 
processing in the conventional moving image encoding apparatus 
and the moving image decoding apparatus. 
20 FIGS. 5A-5C 5 -are schematic drawings showing a relationship 

of sample positions of chrominance components and luminance 
components. 

FIGS. 6A and 6B 6 -are other schematic drawings showing a 
relationship of sample positions of chrominance components and 
25 luminance components. 

FIGS. 7A and 7B 7 — are schematic drawings showing a 
relationship when the pixel positions of the chrominance 
components are converted to the pixel positions of the luminance 
components. 

30 FIG. 8 is a flowchart for explaining a flow of deblocking 

processing in the moving image encoding apparatus and the moving 
image decoding apparatus in a first embodiment. 
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FIGS. 9A and 9B 9 -are other schematic drawings showing a 
relationship when the pixel positions of the chrominance 
components are converted to the pixel positions of the luminance 
components. 

5 FIGS. 1QA and 1QB 4^-are flowcharts for explaining flows of 

other deblocking processing in the moving image encoding 
apparatus and the moving image decoding apparatus in the first 
embodiment. 

FIGS. 11A and 11B -H^are other schematic drawings showing 
10 a relationship when the pixel positions of the chrominance 
components are converted to the pixel positions of the luminance 
components. 

FIGS. 12A and 12B j r2-are other schematic drawings showing 
a relationship when the pixel positions of the chrominance 
15 components are converted to the pixel positions of the luminance 
positions. 

FIGS. 13A-13C - ^3— are drawings for explaining about a 
recording medium for storing a program for causing a computer 
system to execute the moving image encoding method and the 
20 moving image decoding method in each embodiment: FIG. 13A is an 
illustration showing an example of a physical format of a flexible disc 
which is a recording medium body; FIG. 13B is an illustration 
showing an outer view seen from the front of the flexible disc, a 
cross-section structure, and flexible disc; and FIG. 13C is an 
25 illustration showing a structure for recording and reproducing the 
program on the flexible disc FD. 

FIG. 14 is a block diagram showing an overall structure of a 
content supply system for realizing content distribution service. 

FIG. 15 is a schematic diagram showing an example of a cell 

30 phone. 

FIG. 16 is a block drawing showing an internal structure of the 
cell phone. 
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FIG. 17 is a block diagram showing an overall structure of a 
system for digital broadcast. 

Best Mode for Carrying Out the Invention 

5 Hereafter, it is explained about specific embodiments of the 

present invention with references to the drawings. 
(First embodiment) 

The structure of a moving image encoding apparatus 
according to a flow of encoding processing as a whole in a first 

10 embodiment of the present invention is completely the same as the 
conventional structure explained using FIG. 1. Therefore, an 
explanation about the same structure is omitted in here. The 
structure of the moving image encoding apparatus only differs from 
the conventional structure in a deblocking method used by the 

15 deblocking unit 105 shown in FIG. 1. Also, the structure of the 
moving image decoding apparatus according to a flow of decoding 
processing as a whole is completely the same as the conventional 
structure explained using FIG. 2. Therefore, the explanation is 
omitted in here. It differs from the conventional structure only in a 

20 deblocking method used by a deblocking unit 203 shown in FIG. 2. 

Hereafter, it is explained in detail about the processing in the 
deblocking units 105 and 203. Here, the processing details in the 
encoding processing and in the decoding processing are completely 
the same so that they are explained together. 

25 The deblocking processing differs depending on a method of 

decimating chrominance components in the structure of the moving 
image data. Here, three types of decimation methods are 
respectively explained: (1) it is explained about the case where, as 
shown in FIG. 5A, the moving image data in which the chrominance 

30 components are decimated to a half amount in both vertical and 
horizontal directions is inputted; (2) it is explained about a case 
where, as shown in FIG. 5B, the moving image data in which the 



-18- 



chrominance components are decimated to a half amount only in the 
horizontal direction is inputted ; and (3) it is explained about a case, 
as shown in FIG. 5C, where the moving image data in which the 
chrominance components are not decimated is inputted. Note that, 
5 in the following explanation, it is assumed that a target block is 
encoded and decoded as a frame structure. 

(1) half decimation in vertical and horizontal directions 
FIG. 8 is a flowchart for explaining a flow of deblocking 
processes in the case where the moving image in which the 

10 chrominance components are decimated to a half amount in both 
vertical and horizontal directions is inputted. The target moving 
image data is managed by being divided into luminance components 
and chrominance components. Therefore, deblocking is 
independently performed on the respective components. 

15 First, in order to perform deblocking on the luminance 

components, a loop processing is repeated as many times as the 
number of pixels of the luminance components adjacent to the 
target block boundary (Fl and F4), a type of filter is selected using 
encoding information of a pixel of a target luminance component in 

20 each loop (F2), and the selected filter is applied (F3). The 
information about the type of filter selected herein is stored in a 
memory region in which the information can be referred in latter 
processing, while being used for applying filtering to the pixel of the 
target luminance component (F5). Since a boundary formed of a 

25 left vertical edge and a boundary formed of an upper horizontal edge 
are targets for each block, the processing is repeated eight times in 
the case of a block, for example, made up of horizontal 4 x vertical 
4 pixels. 

It is explained about a method for determining types of filters 
30 with reference to FIG. 3. Here, assuming that there are five types 
of filters as an example, the filters are used by switching them 
depending on a characteristic of a block boundary. It is structured 
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that a stronger filter (here indicated as Filter 4) is applied to a 
portion where there is h^s-a higher possibility that block distortion is 
eminently generated and that a weaker filter (here indicated as 
Filter 0) is applied to a portion where there is a ft as-lower possibility 
5 that block distortion is eminently generated. 

FIG. 3A is a drawing showing a boundary between blocks to 
which filters are applied. In the drawing, a center line indicates a 
boundary between blocks, a pixel on the right side shown as Q 
indicates a pixel adjacent to the boundary in a target block, and a 

10 pixel on the left side shown as P indicates a pixel adjacent to the 
boundary in an adjacent block. FIG. 3B is a table showing which 
filters are selected for which conditions the pixel P and the pixel Q 
shown in FIG. 3A have. For example, the Filter 4 is selected in the 
case where the boundary is at the vertical edge and one of the pixel 

15 P and the pixel Q belongs to a block which is intra prediction 
encoded. 

Note that, the table of FIG. 3B only shows an example of a 
method of selecting filters and the number of filters and the 
selection conditions are not limited to the example. Therefore, 

20 other cases can be similarly treated. 

The information about the type of filter determined by the 
mentioned processing is used for applying filtering on the target 
pixel of the luminance component, while being stored in the memory 
region in which the information can be referred to in latter 

25 processing. 

Next, in order to perform deblocking on the chrominance 
components, a loop processing is repeated as many times as the 
number of pixels of the chrominance components adjacent to the 
target block boundary (F6 and F10); a type of filter is selected in 

30 each loop (F8); and the selected filter is applied (F9). Herein, a 
filter to be applied to the chrominance component is determined 
based on the type of filter used for the luminance component. 
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Specifically, the type of filter applied at a pixel position of the 
corresponding luminance component is referred and used is referred 
and used from the memory region in which the information about 
the type of filter determined in the processing for the luminance 
5 component is stored. Herein, the pixel position of the target 
chrominance component is converted to the pixel position of the 
corresponding luminance component (F7a). 

The conversion of information about the position of a pixel is 
performed so as to have the position of the pixel of the luminance 

10 component assigned to a same field as a target pixel of the 
chrominance component when the target block is replaced to the 
filed structure. Here, since the chrominance components are 
decimated to the half amount in both vertical and horizontal 
directions, the conversion is performed using the following 

15 equations. Note that, it is assumed that marks indicate the 
following: XL indicates a horizontal coordinate value of the 
luminance; XC indicates a horizontal coordinate value of the 
chrominance; YL indicates a vertical coordinate value of the 
luminance; and YC indicates a vertical coordinate value of the 

20 chrominance. Further, the mark % in the equation 2(b) indicates 
an operator which operates to add a remainder when division is 
performed. 

XL=2xXC (equation 2(a)) 

YL=2xYC+YC%2 (equation 2(b)) 

25 By applying the filter determined by the mentioned processes, 

deblocking is performed on the chrominance components. Since a 
boundary formed of a left vertical edge and a boundary formed of an 
upper horizontal edge are targeted for each block, for example, in 
the case where the luminance components form a block made up 

30 with horizontal 4 x vertical 4 pixels, here explains as an example of 
decimating the chrominance components to the half amount in both 
vertical and horizontal directions so that the mentioned processing 
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is repeated four times to make the chrominance components to have 
horizontal 2 x vertical 2 pixels. 

FIGS. 9A and 9B 9 -are drawings for explaining a relationship 
between positions of pixels of target chrominance components and 
5 positions of pixels of reference luminance components that are 
converted using the equations 2(a) and 2(b). FIG. 9A shows a 
positional relationship between the luminance components and the 
chrominance components in the frame structure. FIG. 9B shows a 
positional relationship between the luminance components and the 

10 chrominance components in the case where the image is replaced to 
the field structure. Here, L_0 indicates a position of a luminance 
component at zeroth line and C_0 indicates a position of a 
chrominance component at zeroth line. In the case where a 
deblocking filter is applied to the chrominance components of C_l, 

15 it is indicated that the luminance components of L_3 are referred 
using the equation 2(b). Replacing the pixels herein to the field 
structure, it is found that the chrominance components of C_l in the 
bottom field refer to the luminance components of L_3 in the same 
bottom field so as to determine a type of filter. 

20 In the conventional method, as explained using FIGS.-7 7A 

and 7B . in the case where the deblocking filter is applied to the 
chrominance components of C_l, it is assumed to refer to the 
luminance components of L_2 using the equation 1(b). In the 
deblocking, there is a possibility that a type of applied filter differs 

25 for each field in the case where a target block is in a frame structure 
but the adjacent block is in a field structure. That is, the result 
largely differs, for the filter to be applied for the chrominance 
components C_l, depending on whether using the filter used for the 
luminance components in the top field or using the filter used for the 

30 luminance components in the bottom field. Consequently, a gap is 
generated in a degree of compensation of a pixel value because a 
filter to be used is different between the luminance components and 
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the chrominance components in the bottom field. 

However, by using a conversion method according to the 
present invention, a same type of deblocking filter is used for the 
luminance component and the chrominance component which are 
5 making up a pixel in the same field. Therefore, it is possible to 
perform encoding and decoding so as to generate a consistent image 
when the image is displayed in the interlaced-can form. 

Here, instead of using the equations 2(a) and 2(b) for 
converting the pixel position of a target chrominance component 

10 into the pixel position of a corresponding luminance component, the 
following equations also can be used. Note that, XL indicates a 
horizontal coordinate value of the luminance; XC indicates a 
horizontal coordinate value of the chrominance; Y L indicates a 
vertical coordinate value of the luminance; a nd YC indicates a 

15 vertical coordinate value of the chrominance. 

XL=2xXC (equation 2(al)) 

YL=2*YC (equation 2(bl)) 

YL=2xYC+l (equation 2(b2)) 

In the case where the chrominance component belongs to the 
20 top field, a conversion is performed using the equations 2(al) and 
2(bl), and in the case where the chrominance component belongs to 
the bottom field, a conversion is performed using the equations 
2(a2) and 2(b2). 

(2) half decimation in a horizontal direction 
25 FIG. 10A is a flowchart for explaining a flow of deblocking 

processing in the case where an image in which the chrominance 
components are decimated to the half amount only in a horizontal 
direction is inputted. It differs with FIG. 8 only in the deblocking 
for the chrominance components. Therefore, explanation about 
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the same processing other than that is omitted in here. 

In order to perform deblocking on the chrominance 
components, a loop processing is repeated as many times as the 
number of pixels of the chrominance components adjacent to the 
5 target block boundary (F6 and F10); a type of filter is selected in 
each loop (F8); and the selected filter is applied (F9). Herein, a 
filter to be applied to the chrominance component is determined 
based on a type of filter used for the luminance component. 
Specifically, the type of filter is used by referring to the type of filter 

10 for the corresponding luminance component used at the pixel 
position from the memory region in which information about the 
type of filter determined by the processing performed on the 
luminance component is stored. Herein, the pixel position of the 
target chrominance component is converted to the pixel position of 

15 the corresponding luminance component (F7b). 

The conversion of information about a position of a pixel is 
performed so as to have a position of a pixel of a luminance 
component assigned to a same field as a field filed assigned to the 
pixel of the target chrominance component when the target block is 

20 replaced to the field structure. Here, the chrominance components 
are decimated to the half amount only in a horizontal direction. 
Therefore, the conversion is performed using the following 
equations. Note that, XL indicates a horizontal coordinate value of 
the luminance; XC indicates a horizontal coordinate value of the 

25 chrominance; YL indicates a vertical coordinate value of the 
luminance; and YC indicates a vertical coordinate value of the 
chrominance. 

XL=2xXC (equation 3(a)) 

YL=YC (equation 3(b)) 

30 By applying a filter determined by the processing, the 

deblocking is performed on the chrominance components. For each 
block, a boundary made up of a vertical edge on a left side and a 
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boundary made up of a horizontal edge on the upper side are 
targeted. Therefore, in the case where the luminance component is 
a block that is, for example, made up of horizontal 4 x vertical 4 

pixels, the chrominance is horizontal 2 * vertical 4 pixels because 

5 here explains an example of decimating the chrominance 
components to the half amount only in the horizontal direction. 
Consequently, the above processing is repeated six times. 

FIGS. 11A and 11B 11 are drawings for explaining a 
relationship between positions of pixels of target chrominance 

10 components and positions of pixels of reference luminance 
components converted using the equations 3(a) and 3(b). FIG. 11A 
shows a positional relationship between the luminance components 
and the chrominance components in the frame structure. FIG. 11B 
shows a positional relationship between the luminance components 

15 and the chrominance components in the case of replacing the image 
to the field structure. Here, L_0 indicates a position of the 
luminance components at zeroth line and C_0 indicates a position of 
the chrominance components at zeroth line. In the case of applying 
the deblocking filter to the chrominance components of C_l, it is 

20 indicated to refer to the L_l luminance components of L_l using the 
equation 3(b). Replacing the pixels herein to the field structure, it 
is found that the type of filter for the chrominance components of 
C_l in the bottom field is determined by referring to the luminance 
components of L_l in the same bottom field. 

25 Using the mentioned conversion method, the same type of 

deblocking filter is applied to the luminance components and the 
chrominance components which make up a same pixel in the same 
field. Therefore, it is possible to perform encoding and decoding so 
as to generate a consistent image when the image is displayed in the 

30 interlaced scan form. 

(3) no decimation 
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FIG. 10B is a flowchart for explaining a flow of deblocking 
processing in the case where an image in which chrominance 
components are not decimated is inputted. It differs with FIG. 8 
only in the deblocking performed on the chrominance components. 
5 Therefore, explanations about the same processing other than that 
is omitted in here. 

In order to perform deblocking on the chrominance 
components, a loop processing is repeated as many times as the 
number of pixels of the chrominance components adjacent to the 

10 target block boundary (F6 and F10); a type of filter is selected in 
each loop (F8); and the selected filter is applied (F9). Herein, a 
filter to be applied to the chrominance components is determined 
based on the type of filter used for the luminance components. 
That is, a type of filter applied at the pixel position of the 

15 corresponding luminance component is referred and used from the 
memory region in which information about types of filters are 
determined by the processing performed on the luminance 
component is stored. Herein, the pixel position of the target 
chrominance component is converted to a pixel position of the 

20 corresponding luminance component (F7c). 

The conversion of information about a position of a pixel is 
performed so as to have a position of a pixel of a luminance 
component assigned to the same field as a field assigned to a pixel 
of the chrominance component to be targeted when the target block 

25 is replaced to the field structure. Here, since the chrominance 
components are not decimated, the conversion is performed using 
the following equations. Note that, marks indicate the following: 
XL indicates a horizontal coordinate value of the luminance; XC 
indicates a horizontal coordinate value of the chrominance; Y L 

30 indicates a vertical coordinate value of the luminance; and YC 
indicates a vertical coordinate value of the chrominance. 
XL=XC (equation 4(a)) 
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YL=YC (equation 4(b)) 

By applying the filter determined by the mentioned 
processing, the deblocking is performed on the chrominance 
components. For each block, a boundary made up of a vertical 
5 edge on a left side and a boundary made up of a horizontal edge on 
the upper side are targeted. Therefore, in the case where the 
luminance component is a block of, for example, horizontal 4 x 
vertical 4 pixels, the chrominance component is horizontal 4 x 
vertical 4 pixels because it is explained as an example where the 

10 chrominance components are not decimated. Consequently, the 
processing is repeated eight times. 

FIGS. 12A and 12B - t2— are drawings for explaining a 
relationship between positions of pixels of target chrominance 
components and the positions of pixels of reference luminance 

15 components that are converted using the equations 4(a) and 4(b). 
FIG. 12A shows a positional relationship between luminance 
components and chrominance components in the frame structure. 
FIG. 12B shows a positional relationship between the luminance 
components and the chrominance components in the case where the 

20 image is replaced to the field structure. Here, L_0 indicates a 
position of luminance components at zeroth line; and C_0 indicates 
a position of chrominance components at zeroth line. In the case 
where the deblocking filter is applied to the color components of C_l, 
it is indicated to refer the luminance components of L_l using the 

25 equation 4(b). Replacing the pixels herein to the field structure, it 
is found that the type of filter applied to the chrominance component 
of C_l in the bottom field is determined by referring to the 
luminance component of L_l in the same bottom field. 

Using the mentioned conversion method, the same type types 

30 of deblocking filter fi l ters is applied for the luminance components 
and the chrominance components which make up a same pixel in the 
same field. Therefore, it is possible to perform encoding and 
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decoding so as to generate a consistent image in the case where the 
image is displayed in the interlaced-scan form. 

(Second Embodiment) 
5 In addition, by recording a program for realizing the layout of 

the moving image encoding method or the moving image decoding 
method as shown in each of the above-mentioned embodiments, on 
a recording medium such as a flexible disk, it becomes possible to 
perform the processing as shown in the above embodiment easily in 

10 an independent computer system. 

FIGS. 13A-13C 4 3-are diagrams of a recording medium for 
recording a program for realizing the moving image encoding 
method and the moving image decoding method in the above 
embodiment in the computer system. 

15 FIG. 13B shows the front view of a flexible disk and the 

schematic cross-section, as well as a flexible disk itself, whereas FIG. 
13A shows an example of a physical format of the flexible disk as a 
recording medium body. A flexible disk FD is contained in a case F, 
a plurality of tracks Tr are formed concentrically on the surface of 

20 the disk in the radius direction from the periphery, and each track is 
divided into 16 sectors Se in the angular direction. Therefore, in 
the flexible disk storing the above-mentioned program, the moving 
image encoding method as the above program is recorded in an area 
allocated for it on the above flexible disk FD 

25 In addition, FIG. 13C shows the configuration for recording 

and reproducing the program on and from the flexible disk FD. 
When the program is recorded on the flexible disk FD, the computer 
system Cs writes in the moving image encoding method and the 
moving image decoding method as the program on the flexible disk 

30 FD via a flexible disk drive. When the above moving image 
encoding method and the moving image decoding method are 
constructed in the computer system using the program recorded on 
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the flexible disk, the program is read out from the flexible disk via 
the flexible disk drive and transferred to the computer system. 

Note that the above explanation is made on an assumption 
that a recording medium is a flexible disk, but the same processing 
5 can also be performed using an optical disk. In addition, the 
recording medium is not limited to these, but any other mediums 
such as a CD-ROM, memory card, and a ROM cassette can be used in 
the same manner if a program can be recorded on them. 

Further, the applications of the moving image encoding 
10 apparatus and the moving image decoding apparatus illustrated in 
the above embodiments and a system using them shall be explained 
here. 

FIG. 14 is a diagram showing the overall configuration of a 
content providing system exlOO for realizing content distribution 

15 service. The area for providing communication service is divided 
into cells of desired size, and base stations exl07 to exllO which 
are fixed wireless stations are placed in respective cells. 

In this content providing system exlOO, various devices such 
as a computer exlll, a personal digital assistant (PDA) exll2, a 

20 camera exll3, a cell phone exll4 and a camera-equipped cell 
phone exll5 are connected to the Internet exlOl, via an Internet 
service provider exl02, a telephone network exl04 and base 
stations exl07 to exllO, for example. 

However, the content providing system exlOO is not limited to 

25 the combination as shown in FIG. 14, and may be connected to a 
combination of any of them. Also, each device may be connected 
directly to the telephone network exl04, not through the base 
stations exl07 to exllO which are the fixed wireless stations. 

The camera exll3 is a device such as a digital video camera 

30 capable of shooting moving images. The cell phone may be any of 
a cell phone of a Personal Digital Communications (PDC) system, a 
Code Division Multiple Access (CDMA) system, a Wideband-Code 
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Division Multiple Access (W-CDMA) system or a Global System for 
Mobile Communications (GSM) system, a Personal Handy-phone 
System (PHS) and the like. 

Also, a streaming server exl03 is connected to the camera 
5 exll3 via the base station exl09 and the telephone network exl04, 
which realizes live distribution or the like using the camera exll3 
based on the encoded data transmitted from the user. The 
encoding of the data shot by the camera may be performed by the 
camera exll3, the server for transmitting the data, or the like. 

10 Also, the moving image data shot by a camera exll6 may be 
transmitted to the streaming server exl03 via the computer exlll. 
The camera exll6 is a device such as a digital camera capable of 
shooting still and moving images. In this case, either the computer 
exlll or the camera exll6 may encode the moving image data. 

15 An LSI exll7 included in the computer exlll or the camera exll6 
performs the encoding processing. Note that software for encoding 
and decoding moving images may be integrated into any type of a 
recording medium (such as a CD-ROM, a flexible disk and a hard 
disk) that is readable by the computer exlll or the like. 

20 Furthermore, the camera-equipped cell phone exll5 may transmit 
the moving image data. This moving image data is the data 
encoded by the LSI included in the cell phone exll5. 

In this content providing system exlOO, contents (such as a 
video of a live music performance) shot by users using the camera 

25 exll3, the camera exll6 or the like are encoded in the same 
manner as in the above embodiments and transmitted to the 
streaming server exl03, while the streaming server exl03 makes 
stream distribution of the above content data to the clients at their 
request. The clients include the computer exlll, the PDA exll2, 

30 the camera exll3, the cell phone exll4, and the like, capable of 
decoding the above-mentioned encoded data. The content 
providing system exlOO is a system in which the clients can thus 
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receive and reproduce the encoded data, and further can receive, 
decode and reproduce the data in real time so as to realize personal 
broadcasting. 

When each apparatus in this system performs encoding or 
5 decoding, the moving image encoding apparatus or the moving 
image decoding apparatus shown in each of the above-mentioned 
embodiments may be used. 

A cell phone is now explained as an example thereof. 

FIG. 15 is a diagram showing a cell phone exll5 which uses 

10 the moving image encoding method and the moving image decoding 
method as explained in the above embodiments. The cell phone 
exll5 has an antenna ex201 for communicating radio waves with 
the base station exllO, a camera unit ex203 such as a CCD camera 
capable of shooting moving and still images, a display unit ex202 

15 such as a liquid crystal display for displaying the data obtained by 
decoding video shot by the camera unit ex203, video received by the 
antenna ex201, or the like, a main body including a set of operation 
keys ex204, a voice output unit ex208 such as a speaker for 
outputting voices, a voice input unit ex205 such as a microphone for 

20 inputting voices, a recording medium ex207 for storing encoded or 
decoded data, such as data of moving or still images shot by the 
camera, and data of text, moving images or still images of received 
e-mails, and a slot unit ex206 for attaching the recording medium 
ex207 into the cell phone exll5. The recording medium ex207 

25 includes a flash memory element, a kind of Electrically Erasable and 
Programmable Read Only Memory (EEPROM) that is an electrically 
rewritable and erasable nonvolatile memory, in a plastic case such 
as an SD card. 

Further, the cell phone exll5 is explained with reference to 
30 FIG. 16. In the cell phone exll5, a power supply circuit unit ex310, 
an operation input control unit ex304, an image encoding unit ex312, 
a camera interface unit ex303, an Liquid Crystal Display (LCD) 
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control unit ex302, an image decoding unit ex309, a 
multiplex/demultiplex unit ex308, a record/reproduce unit ex307, a 
modem circuit unit ex306 and a voice processing unit ex305, are 
connected to a main control unit ex311, and to each other, via a 
5 synchronous bus ex313. The main control unit ex311 is for the 
overall controlling of each unit of the main body including the display 
unit ex202 and the operation keys ex204. 

When a call-end key or a power key is turned ON by a user's 
operation, the power supply circuit unit ex310 supplies the 

10 respective units with power from a battery pack so as to activate the 
camera-equipped digital cell phone exll5 to a ready state. 

In the cell phone exll5, under the control of the main control 
unit ex311 including a CPU, ROM, RAM and the like, the voice 
processing unit ex305 converts the voice signals received by the 

15 voice input unit ex205 in conversation mode into digital voice data, 
the modem circuit unit ex306 performs spread spectrum processing 
of the digital voice data, and the communication circuit unit ex301 
performs digital-to-analog conversion and frequency transformation 
of the data, so as to transmit the result via the antenna ex201. 

20 Also, in the cell phone exll5, the communication circuit unit 
amplifies the data received by the antenna ex201 in conversation 
mode and performs the frequency transformation and an 
analog-to-digital conversion; the modem circuit unit ex306 
performs inverse spread spectrum processing of the data; and the 

25 voice processing unit ex305 converts it into analog voice data, so as 
to output the result via the voice output unit ex208. 

Furthermore, when transmitting an e-mail in data 
communication mode, the text data of the e-mail inputted by 
operating the operation keys ex204 of the main body is sent out to 

30 the main control unit ex311 via the operation input control unit 
ex304. In the main control unit ex311, after the modem circuit unit 
ex306 performs spread spectrum processing of the text data and the 
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communication circuit unit ex301 performs a digital-to-analog 
conversion and frequency transformation for the text data, the data 
is transmitted to the base station exllO via the antenna ex201. 

When image data is transmitted in data communication mode, 
5 the image data shot by the camera unit ex203 is provided to the 
image encoding unit ex312 via the camera interface unit ex303. 
When the image data is not transmitted, the image data shot by the 
camera unit ex203 can also be displayed directly on the display unit 
202 via the camera interface unit ex303 and the LCD control unit 
10 ex302. 

The image encoding unit ex312, including the moving image 
encoding apparatus explained in the present invention, compresses 
and encodes the image data provided from the camera unit ex203 by 
the encoding method used for the moving image encoding apparatus 

15 as shown in the above-mentioned embodiments so as to transform it 
into encoded image data, and sends it out to the 
multiplex/demultiplex unit ex308. At this time, the cell phone 
exll5 sends out the voices received by the voice input unit ex205 
during the shooting by the camera unit ex203, to the 

20 multiplex/demultiplex unit ex308 as digital voice data via the voice 
processing unit ex305. 

The multiplex/demultiplex unit ex308 multiplexes the 
encoded image data provided from the image encoding unit ex312 
and the voice data provided from the voice processing unit ex305 

25 using a predetermined method, and the modem circuit unit ex306 
then performs spread spectrum processing of the multiplexed data 
obtained as the result, and the communication circuit unit ex301 
performs digital-to-analog conversion and frequency transformation 
on the result, for the transmission via the antenna ex201. 

30 As for receiving data of a moving image file which is linked to 

a website or the like in data communication mode, the modem 
circuit unit ex306 performs inverse spread spectrum processing of 
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the data received from the base station exllO via the antenna 
ex201, and sends out the multiplexed data obtained as the result to 
the multiplex/demultiplex unit ex308. 

In order to decode the multiplexed data received via the 
5 antenna ex201, the multiplex/demultiplex unit ex308 demultiplexes 
the multiplexed data into an encoded bit stream of image data and 
an encoded bit stream of voice data, and provides the encoded 
image data to the image decoding unit ex309 and the voice data to 
the voice processing unit 6x305, respectively via the synchronous 

10 bus ex313. 

Next, the image decoding unit ex309, including the moving 
image decoding apparatus explained in the present invention, 
decodes the encoded bit stream of the image data using the 
decoding method corresponding to the encoding method as shown in 

15 the above-mentioned embodiments, so as to generate reproduced 
moving image data, and provides this data to the display unit ex202 
via the LCD control unit ex302, and thus moving image data 
included in a moving image file linked to a website, for instance, is 
displayed. At the same time, the voice processing unit ex305 

20 converts the voice data into analog voice data, and provides this 
data to the voice output unit ex208, and thus voice data included in 
the moving image file linked to a website, for instance, is 
reproduced. 

The present invention is not limited to the above-mentioned 
25 system since satellite or ground-based digital broadcasting has been 
in the news lately, and at least either the moving image encoding 
apparatus or the moving image decoding apparatus in the 
above-mentioned embodiments can be incorporated into a digital 
broadcasting system as shown in FIG. 17. More specifically, an 
30 encoded bit stream of video information is transmitted from a 
broadcast station ex409 to a communication or broadcast satellite 
ex410 via radio waves. Upon receipt of it, the broadcast satellite 
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ex410 transmits radio waves for broadcasting, a home antenna 
ex406 with a satellite broadcast reception function receives the 
radio waves, and an apparatus such as a television (receiver) ex401 
or a Set Top Box (STB) ex407 decodes the encoded bit stream for 
5 reconstruction. The moving image decoding apparatus as shown in 
the above-mentioned embodiments can be implemented in a 
reproduction device ex403 for reading and decoding an encoded bit 
stream recorded on a storage medium ex402 such as a CD and DVD 
that is a recording medium. In this case, the reproduced video 

10 signals are displayed on a monitor ex404. It is also conceived to 
implement the moving image decoding apparatus in the set top box 
ex407 connected to a cable ex405 for cable television or the antenna 
ex406 for satellite and/or ground-based broadcasting so as to 
reproduce them on a monitor ex408 of the television. The moving 

15 image decoding apparatus may be incorporated into the television, 
not in the set top box. Also, a car ex412 having an antenna ex411 
which can receive signals from the satellite ex410, the base station 
exl07 or the like, for reproducing moving images on a display device 
such as a car navigation system ex413 or the like in the car ex412. 

20 Furthermore, the moving image encoding apparatus as shown 

in the above-mentioned embodiments can encode image signals and 
record them on a recording medium. As a concrete example, there 
is a recorder ex420 such as a DVD recorder for recording image 
signals on a DVD disk ex421 and a disk recorder for recording them 

25 on a hard disk. They can also be recorded on an SD card ex422. If 
the recorder ex420 includes the moving image decoding apparatus 
as shown in the above-mentioned embodiments, the image signals 
recorded on the DVD disk ex421 or the SD card ex422 can be 
reproduced for display on the monitor ex408. 

30 As for the configuration of the car navigation system ex413, 

the configuration without the camera unit ex203, the camera 
interface unit ex303 and the image encoding unit ex312, out of the 
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units as shown in FIG. 16, is conceivable. The same applies to the 
computer exlll, the television (receiver) ex401 and others. 

Moreover, three types of implementations can be conceived 
for a terminal such as the above-mentioned cell phone exll4; a 
5 communication terminal equipped with both an encoder and a 
decoder, a sending terminal equipped with an encoder only, and a 
receiving terminal equipped with a decoder only. 

As described above, it is possible to use the moving image 
encoding method or the moving image decoding method described 
10 in the above embodiments in any of above-mentioned devices and 
systems, and thus the effects explained in the above embodiments 
can be obtained. 

Also, the present invention is not limited to the above 
mentioned embodiments so that various modifications and 
15 corrections can be made within the scope of the present invention. 

Further, each functional block shown in the block diagrams 
FIG. 1 and FIG. 2 is typically realized as a LSI which is an integrated, 
circuit. This LSI may be constructed in one chip or in a plural chips 
(for example, a functional block other than memory may be 
20 constructed in one chip.) Here, it is called as LSI. However, it 
can be called as an IC, a system LSI, a super LSI, or an ultra LSI 
depending on an integration density. 

In addition, a method of constructing an integrated circuit is 
not limited to the LSI so that it may be realized as a private circuit 
25 or as a general processor. After the LSI is manufactured, a Field 
Programmable Gate Array (FPGA) capable of programming, and a 
reconfigurable processor which is can reconfigure connection and 
setting of circuit cells in the LSI may be used. 

Furthermore, there is no doubt that, if a technology of 
30 constructing an integrated circuit is introduced as a replace of the 
LSI by a development of semiconductor technology or a derived 
another technology, the functional block may be integrated using 
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the new technology. There is a possibility to apply a biotechnology 
and the like. 

Industrial Applicability 

5 As described in the above, a moving image encoding method 

and a moving image decoding method according to the present 
invention are useful as methods for encoding each picture which 
makes up an inputted image, outputs as a moving image encoded 
data, and decode the moving image encoded data, using, for 
10 example, a cell phone, a DVD apparatus, a personal computer and 
the like. 
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ABSTRACT 

In the case where a deblocking filter applied to a luminance 
component is referred to and applied to a chrominance component, 
by converting a pixel position of the chrominance component so as 
5 to have a pixel position of a luminance component which belongs to 
a same field as the chrominance component (F7a) and referring to 
the luminance component, the present invention makes it possible 
to generate a more natural image by filtering the luminance 
component and the chrominance component with the same strength 
10 when the image is displayed in an interlaced-scan form. 
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